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ABSTRACT 


It  is  well  known  that  if  . . a  are  positive 

n  y  1/r 

numbers,  then  the  mean  (la^/n)  is  increasing  in  r. 

In  this  paper  we  obtain  conditions  for  monotonicity  of  the 
ratio  ( £a*/Eb^) of  means. 

Monotonicity  of  a  ratio  can  be  viewed  as  a  form  of  total 
positivity.  The  theory  of  total  positivity  is  exploited  to 
obtain  more  general  results. 

The  proof  of  monotonicity  is  based  on  a  theorem  giving 
sufficient  conditions  for  majorization.  Several  other  appli¬ 
cations  of  the  majorization  theorem  are  given.  One  applica¬ 
tion  concerns  a  stochastic  comparison  between  a  function  of 
order  statistics  from  a  distribution  with  increasing  failure 
rate  average  and  the  same  function  of  the  order  statistics 
from  the  exponential  distribution.  Another  application  is 
to  a  comparison  between  the  condition  number  of  a  positive 
definite  matrix  and  the  condition  number  of  a  polynomial  in 


the  matrix. 
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1.  Introduction. 


It  is  well  known  that  if  a^,  . ..,  a^  are  positive 


numbers,  then  the  mean 


(?a> 


l/r 


is  increasing  in  r.  In  this 


paper,  we  first  obtain  conditions  for  monotonicity  of  the  ratio 


n  n 

i  •;/ 1  "i 

11 


l/r 


of  means  (and  for  its  continuous  version).  The 


conditions  for  monotonicity  permit  =  . . .  =  b^  =  1,  so  that 
Ib^  =  n,  and  thus  the  classical  result  becomes  a  special  case. 


Since  monotonicity  of  a  ratio  can  be  viewed  as  a  form  of 
total  positivity,  the  theory  of  total  positivity  is  exploited 
(Section  3)  to  obtain  more  general  versions  of  the  result  above. 


The  proof  of  monotonicity  is  based  on  a  theorem  which  gives 
sufficient  conditions  for  majorization .  Because  majorization  leads 
to  many  different  forms  of  inequalities  (see  Hardy,  Littlewood, 
and  Pcflya  (1932)>  and  Beckenbach  and  Bellman  (1961)),  it  is  not 
surprising  that  a  diversity  of  applications  of  the  majorization 
result  are  obtained. 


One  application  concerns  a  stochastic  comparison  between  a 
function  of  order  statistics  from  a  distribution  with  increasing  failure  rate 
average  and  the  same  function  of  the  order  statistics  from  the  exponential 
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distribution.  The  use  of  these  stochastic  comparisons  in  testing  certain 

statistical  hypotheses  is  pointed  out  (Section  5)* 

Marshall  and  Olkin  (1965)  compare  the  condition  number  of  a  matrix 

* 

A  with  the  condition  number  of  its  symmetrized  version  AA  .  The 

majorization  result  is  applied  (Section  6)  to  obtain  a  comparison  between 

the  condition  number  of  a  positive  definite  matrix  and  the  condition 

number  of  a  polynomial  in  the  matrix. 

In  the  final  application,  several  inequalities  concerning  absolute 

deviations  are  obtained  (Section  7)  which  generalize  known  results. 

Remark.  To  avoid  awkward  notation  we  often  omit  subscripts  in  sums 

n 

or  in  vectors,  e.g.,  2x  log  x  =  2  x .  log  x. ,  and 

1  1  1 


a 


2a 


v  r 

f  'i 


2 .  Majorization. 

To  determine  conditions  on  (a, ,...,a  )  and  (b,,...,b  )  such  that 

'  1  7  n  '  l7  n7 

i*  /  r  X./  r  f 

g(r)  s  (2a  /2b  )  '  is  nondecreasing  in  r,  one  might  set  d  log  g(r)/dr  >  0, 

or  equivalently, 

<■-  ns  -  a  1  -  (si  • 

Inequalities  of  this  type  can  be  obtained  under  appropriate  condi¬ 
tions  using  majorization. 

k  k 

Definition  2.1.  If  a^  >  . . .  >  an,  >  . . .  >  bn,  £  a^  >  £  b^ 

for  k  =  1,  2,  ...,  n  -  1,  and  £  a.  =  £  b  ,  then  a  =  (a.,..., a  )  is 

^  J  ^  J  ->  n 

said  to  majorize  b  =  (b^,  ...,bn),  written  a  >-b. 


Remark.  An  alternate  definition  of  majorization  is  sometimes  used: 
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A  set  (a)  is  said  to  majoriz<_-  a  set  (b)  if,  possibly  after  ordering 
cf  the  elements  of  each  set,  the  conditions  of  Definition  2.1  are 
satisfied  (Hardy,  Littlewood,  and  Pdlya  (1952),  page  +5)  • 

Definition  2.2.  A  real  function  9  of  n  real  variables  is  said 
to  be  a  Schur  function  if  for  every  pair  1  t  J,  (vxj>  (|r  -§tj  -  °- 
These  concepts  are  linked  in  the  basic  theorem: 

Theorem  2 o .  (Schur  (1923),  Ostrovski  (1952))*  Let  <p(x)  be  defined 

for  x  >  ...  >  x  .  cp(e)  >rp(b)  for  all  avb  if  and  only  if  <p  is  a  Schur  function. 

x  n 

Remark.  If  the  alternate  definition  of  majorization  is  used  (see 
the  remark  after  Definition  2.1),  then  the  requirement  that  q>  be 
symmetric  must  be  imposed. 

Nov  Zx  log  x  is  a  Schur  function  since  all  functions  of  the  form 

^  g(xi)>  g  convex,  are  Schur  functions.  Thus  to  prove  (2.1)  it  is 

sufficient  that  (a/Zct)  y  (p/Z0),  where  0^  =  a^  and  p^  =  b^,  i  =  1,  ...,n. 

Theorem  2.4.  If  a  >  0,  ...,  a  >0,  (3,  >  . . .  >  p  >0,  and 
-  1  n  1  —  -  n 

~  <  ...  •'  >  then  (—■-)  y  (t^t)  . 

a,  -  -  a  vZcr  r  vz,p' 

1  n 

Proof .  Note  that  the  hypotheses  imply  >  ...  >  >  0.  Thus, 

ve  must  prove  that  for  k  =  1,  ...,  n  -  1, 


k  n  k  n 

I V  £  aL  >  Z  V  Z  p,  • 

1  J  1  1  1  J  1  * 


This  follows  from 


n 


k  n  k  n  k  n  k 

Z Q, Z  -  Z Z>.  =  Z a,  Z  B,  -  Z s,  Z  a, 


1  J  T  1 


1  J  T  1 


t  t-  "  0 
1  J  k+1 


1  ^  "i 
1  J  k+1  6 


k  n  ii  p 

Y  f  a. a -4  >  C. 

jti  *  =k+i  J 1  at  "y 
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Using  Theorems  2.3  and  2.4  with  (2.1),  we  have  thus  d -rived  (for  r  >  0) : 

Theorem  2 . 5«  If  a^  >  0,  . . . ,  a^  >  0,  b^  >  . . .  >  b^  >  0, 
b  b  .  / 

—  v.  .  .  .  <  —  ,  then  (Zar/Zbr)  'r  is  increasing  in  r. 

a.  -  -  a 

1  t. 

A  more  direct,  though  possibly  less  clearly  motivated,  proof  of 
Theorem  2.5  is  based  on  an  application  of  Theorem  2.3  to  the  Schur 
function  qp(x^, . . .  ,xn)  =  Zx^,  t  >  1.  Again  for  r  >  0,  let  =  a.^ 

and  0  =  b^>  for  r  <  0,  let  0-j.  =  bn-i+i*  111611  by 

Theorems  2.3  and  2.4,  Z(ar/Zar)^  >  Z(br/Zbr)^.  The  conclusion  for 

r  >  0  (r  <  0)  is  a  consequence  of  choosing  t  =  s/r,  s  >  r  (s  <  r). 

r  /  i*  r 

Finally,  monotonicity  in  r  follows  from  the  continuity  of  (Za  /Zb  )  ' 
at  r  =  0. 

An  important  example  in  which  the  conditions  of  Theorem  2.4  are 
satisfied  is  obtained  by  choosing  =  \|r(0^),  where  \Jr  is  a  non-negative 
starshaped  function.  (A  real  function  \jr  defined  on  [0,»)  is  said  to  be 
starshaped  if  \Jr(x)/x  is  increasing  in  x.  An  interesting  example  of 
starshapedness  is  convex  on  [0,*),  t(0)  <  0.  Note  that  a  non-negative 

starshaped  function  must  be  increasing  and  must  satisfy  \Jf(o)  =  0. 

Such  functions  are  discussed  by  Bruckner  and  Ostrow  (1962).)  With  this 
choice,  assuming  0^  >  ...  >  0^  >  0,  it  follows  that  >  ...  >  >  0, 

so  that  by  Theorem  2.4, 

'Z0  ^  Z^f(0j  “  'Zcr  * 

5 .  An  Extension  Using  Total  Positivity. 

Theorems  2.4  and  2.5  can  be  viewed  as  theorems  on  total  positivity 
and  can  !e  proved  by  the  methods  of  total  positivity.  A  matrix  (t.  .) 
of  non-negative  numbers  is  said  to  be  totally  positive  of  order  2 
(TPg)  if  all  its  2x2  minors  are  non-negative.  Similar ly,  a  non-negative 


1 


I 


t  ■ 
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function  t(x,y)  of  real  variables  x  belonging  to  36  and  y  belonging 
to  y  is  said  to  be  totally  positive  of  order  2  if,  for  all  x^  <  x^ 
i..  and  y^  •  y„  ir.  ,  the  determinant 

/t(x1,y1)  t(xL,y2)\ 


det 


yt(x2,yL)  t(x2,y2) 


>  0  . 


Suppose  the  t^  are  non-negative;  then  (t^)  is  clearly  TP2  if 

and  only  if  t.  /t  is  increasing  in  j  for  each  i. 

1  +  ljJ  Ij 

Let  t  ^  a  >  0,  tg ,  =  0 .  >  0,  J  =  1,  2,  n;  then 


increasing  in  i  is  equivalent  to  (t  )  being  TPp.  Since  it  is 

J  k  £  k  .  I 


eas 


ily  concluded  from  Theorem  2.4  that  Za./  2  a.  >£  P./  Z  P.  for 

1  1  1  J  ~  1  1  1  *3 


all  k  <  i,  Theorem  2.4  essentially  states  that  if  (t  )  is  TPp 
k  *  J  ^ 

ar-d  hk E  ^  hr  then  'V  13  TV Wrlte  hk  ■  Vjk* Khere 

is  1  for  j  <  k  and  is  0  for  j  >  k.  Since  (Hjk)  is  TPg, 
Theorem  2.4  follows  from  a  standard  theorm  of  total  positivity  which 


asserts  that 


p(x,z)  =  f(p1  (x,y)  q>2(y,z)  dn(y) 

is  TPg  provided  <p^(x,y)  and  cp2(y,z)  are  eac^  TPg. 

Because  mor.otonicity  of  ratios  can  be  interpreted  as  total 
positivity  of  order  2,  Theorem  2.5  also  is  a  statement  of  total 
positivity.  This  suggests  the  following  generalization: 

Theorem  3  •  Let  g(x,y)  be  TPg  and  decreasing  in  y  for  each 
x .  Then 


is 


h(x,s)  h  {/g1//o(x,y)  da(y)  }S 


(ywJfKS  -  s'; 


r 


\V: 


Proof.  Define 


u(x,s)  =  log  h(x,s )  =  s  log  f gS(x,y)  d^i(y)  , 


then 


-  s 


1 

r  s  . 

g  du 


6^u(x,s ) 
c^x  6s 


I  1.!  I  ,  I 

r  o  p  s  ^  p  s  P  s 

g  d4  '  g  exlo6  g  du  -  /  g  g^dn  /  g°log  g  du 


=  det 


rgSdu 


r  ^  a 

'  g  8xdn 


i-1 


P  „  r  G 

i  ;  c“iog  g  dM.  g"  gxL°g  g  d^j 

I 


!i 


r 


det 


yl<y2 


--1 

s 


g"(x,yL)  g  (x,yL)  gx(x,yL) 


1  s’1 

E3(x,y2)  gS  (x,yg)  gx(x,y2) 


det 


1  log  g(x,yL) 


1  log  g(x,y2) 


dn(yL) 


from  Problem  68,  Po'lya  and  Szego  (1925)*  The  first  determinant  of  the 
integrand  is  non-negative  since  g(x,y)  is  TPg.  The  second  determinant 
is  non-positive  since  g(x,y)  is  decreasing  in  y  for  each  x.  The 
result  follows  from  the  fact  that  a  non-negative  function  h(x,s)  is 
TP^  3  and  on  ly  if  d2log  h(x,s)/6x  6s  >  0.  || 

To  obtain  Theorem  2.5  from  Theorem  5.1,  let  y  range  over  the 
values  1,  2,  ...,  n  and  choose  g(l,i)  =  a^  g(2,i)  =  b^  and  u(i)  =  l/n. 
W-_  car.  extend  Theorem  2.5  slightly  be  choosing  n(i)  =  p^  >0,  Zp^  =  1. 


du(y2) 


t 
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I*  1*  1  / 

Under  the  hypothecs  of  Theorem  2-5  we  conclude  that  (Zpa  /Zpb  )  '  is 
increasing  in  r  .  A  further  extension  is  obtained  in  the  next 

section. 

4 .  Monotonicity  of  Ratios  of  Means  . 

With  Theorem  2.5  in  mind,  consider  right  continuous  distributions 
G  and  F  of  non-negative  random  variables  approximated  by  step  functions 

with  jumps  l/n  at  a  and  b  respectively,  a  >  ...  >  a  >0, 

i  1  l  —  n 

b  >  ...  >  bn  >  0.  Then  G(a1)  =»  i/n  «  F^)  (where  F=l-F,  G=l-G), 
and  the  condition  b  /V  increasing  in  i  becomes  F~\i/n)/  G  -1(i/n) 
increasing  in  i .  In  the  limit,  we  obtain  the  condition  F  Hp)/G  "\p) 
increasing  in  p. 

To  avoid  the  necessity  of  assuming  that  the  distribution  functions 
are  strictly  increasing,  define 

H~\p)  =  inf (x  >  0:  H(x)  <  p},  h'^Cp)  =  inf{x  >  0:  H(x)  >  p}  , 

where  H  is  a  distribution  function. 

Theorem  4.1.  If  F  and  G  are  distribution  functions, 

F(0)  =  0  =  G(0)  ,  and  F_1(p)/G”1(p)  is  increasing  in  p,  then 

JJxr ac(x)  1/r 

/xrdj(x)' 

is  increasing  in  r. 

Proof .  Using  the  approximations  above,  a  limiting  argument  may  be 
used  to  obtain  the  result  from  Theorem  2.5*  Alternately,  it  may  be 
obtained  as  a  special  case  of  Theorem  3 .  1  by  choosing  g(l,y)  -  G~\y), 
g(2,y)  =  F_i(y)  and  taking  n  to  be  uniform  on  [0,1].  || 


8 


Whe:.  F(0  )  =  0  =  G(0  ),  F  and  G  are  continuous  and  have 
continuous  inverses,  the  hypothesis  of  Theorem  4.1  has  the  following 
equivalent  formulations : 

(i)  F~\p)/ G  ”^(p)  h  F~1(l-p)/G~1‘(l-p)  is  increasing  in  p, 

0  v  p  <  1, 

(ii)  G~^  F(x)/x  h  g  *T(x)/x  is  increasing  in  x  in  the  support 
of  F, 

(iii)  F(x)  =  G(^(x))  for  some  no.  -negative  starshaped  function  \|r, 

(iv)  If  X  is  a  random  variable  with  distribution  F,  then  <p(X) 
has  distribution  G  for  some  non-negative  star  shaped  t|r. 

We  say  that  F  is  starshaped  with  respect  to  G  if  (i),  (iii)  or 
(iv)  holds,  because  (i),  (iii)  and  (iv)  are  equivalent  even  without  the 
continuity  restrictions  on  F  and  G.  Since  the  equivalence  of  (iii) 
and  (iv)  is  easily  verified,  we  need  show  only  that  (i)  and  (iii)  are 
equivalent.  Let  J  be  the  set  of  all  x  such  that  x  =  inf  (z  :  H(z)  <  p) 

r 

for  some  pe(0,l). 

Theorem  4.2.  If  F  and  G  are  right  continuous  distribution 

functions  such  that  F(G  )  -  G(0  )  =  0,  then  F(x)  =  G(<p(x))  for  some 

strictly  increasing  function  rp  such  that  cp(x)/x  is  increasing  for  all 

x  if  and  only  if  F  ^  (p)/G  ^(p)  is  increasing  in  p,  0  <  p  <  1. 

r 

Proof.  Suppose  F(x)  =  G(rp(x))  for  some  non-negative  strictly 
increasing  rp  such  that  c p(x)/x  is  increasing  in  x  .  Then 

r 

F  "  ^(p)  _  inf (x:  F(x)  -  p)  _  inf(x:  F(x)  i  p)  _  inf(y:  F(y)  <  p)  _ 

G~1(p)  inf (x:  G(x)  •-  p)  inf (x:F(qp  i(x))  •-  p)  inf{<p(y):  F(y)  <  p) 


1 
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Nov  p  <  P  implies  y^  >  y_  „  .  Thus  p  <  p  implies 

F  ~1(p)  <  F~1(p*) 

G  _1(p)  “  G  ~\p*) 


,  ,  ,  ,  (  )  .  F  “L(p) 

yn/?(yD)  i  yv*/p(y  *)>  1,e*>  _-i 


Next,  suppose  F  *L(p)/ G_1(p)  is  increasing  in  p,  let  y  e^F 
and  define  qp(y)  =  G_1F(y)  =  inf{z:  G(z)  <F(y)}.  Then  since 

F  Lsl  -  F(z)  ^  pJ  _  — P  .  is  increasing  in  p,  it  follows 

G"  1(p)  inf {z:  G(z)  <_  p)  ^yp 

that  "p(y ) /y  is  increasing  in  y  aF.  Also  9(y)  is  strictly 
increasing  because  F(y)  is  strictly  decreasing  in  y  e^p.  il 

Some  choices  of  F  and  G  in  Theorem  4.1  are  of  special  interest. 

I.  Let  F  be  degenerate  at  1.  Then  F’Wg'^p)  =  l/G-\p) 
is  increasing  for  all  G.  In  this  case,  Theorem  4.1  reduces  to  the 

y »  T  / 

well-known  fact  that  [fx  dG(x)]  '  is  increasing  in  r  for  every  G. 

II.  Let  G(x)  =  e”X.  Then  the  condition  F"\p)/G  ^(p) 
increasing  in  p  becomes  -  x_1log  F(x)  increasing  in  x.  A  distribution 
F  with  this  property  is  said  to  have  an  increasing  failure  (hazard)  rate 
average  (IFRA) ,  since  in  case  F  has  a  density  f,  -  log  F(x)  =  j  q(u)du, 
where  q(u)  =  f(u)/  F(u)  is  the  failure  rate  of  F.  Similarly,  if 
x'Hog  F(x)  is  decreasing,  we  say  F  has  a  decreasing  failure  (hazard) 
rate  average  (DFRA).  The  class  of  IFRA  (DFRA)  distributions  has 
applications  in  reliability  theory  (see  Birnbaum,  Esary,  and  Marshall 
(1965)).  From  Theorem  4.1  we  see  that  if  F  is  IFRA  (DFRA), 
[/xrdF(x)/r(r+l)  ]1//r  is  decreasing  (increasing)  in  r>  0.  This  conclusion 
was  obtained  for  the  smaller  class  of  distributions  with  increasing 


(decreasing)  failure  rate  by  Barlow,  Marshall,  and  Proschan  (l9o3)» 
Properties  of  order  statistics  from  IFRA  (DFRA)  distributions 
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are  developed  in  Section  5* 

Choose  F(x)  =  e  X  and  G  a  DFRA  distribution  for  which 
/::rdC.(>;)  =  rr,r>0.  (A  proof  that  such  a  distribution  exists  has  been 
o':  tained  by  Herman  Rubin  and  is  presented  in  the  appendix.)  From 
T.-.esren  4.1  it  follows  that  [r  /f(r+l)  ]  '  is  an  increasing  function 
of  r>0,  a  result  obtained  by  Mine  and  Sathre  (1964). 

III.  Let  F(x)  =  G(xt),  t  >  1.  Then  G-1(p)  =  [F_1(p)]t  so 
that  F_1(p)/  G  (p)  =  [F  L(p)  }l  ^  is  increasing  in  p.  Thus,  the 
conditions  of  Theorem  4.1  are  satisfied,  and  so 

/xrdG(x) 

JxrdF(x) 

is  increasing  in  r.  More  generally,  if  X  has  distribution  F  and 
Y  =  t(X)  has  distribution  G  where  ^  >  0  is  starshaped,.  then 

(/Wx)]r  dF(x)//xrdF(x))1//r 

is  increasing  in  r. 
p .  Statistical  Applications. 

I:.  the  present  section  we  make  some  comparisons  involving  the  order 
statistics  from  distributions  F  and  G,  where  F  is  starshaped  with 
respect  to  G. 

Let  X,  >  . . .  >  X  be  order  statistics  from  F.  Then  from  (2.2) 
we  have  for  any  starshaped  >  0  that 


Consequently,  if  F  is  starshaped  with  respect  to  the  distribution 

G,  a:.d  X,  >  ...  >  X  are  order  statistics  from  F,  Y,  >  ...  >  Y  are 

1  -  —  n  1  -  -  n 


5 

ZX 


JL  r 

"v  > • • • >  V 


X 

J 

zx 


•< 


H\) 


\K(X  ) 
'  n 


H  1 

/  xr^dF(x) 

/xr  dF(x) 

m  m 
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order  statistics  from  G,  then 


X1 

X 

st 

/  Y. 

Y  1 

1 

n 

1 

n 

ZX  ’ *  *  * ' 

ZX 

!zy> 

ZY / 

k  n  k  n 

i.e.,  Y  X .  /  Y  X.  is  stochastically  less  than  or  equal  to  Y.  /  /  Y. 

i  T  1  i  i 

for  k  =  1,  . . .,  n.  (A  random  variable  U  is  stochastically  greater 

st 

than  or  equal  to  a  random  variable  V,  denoted  by  U  >  V  ,  if 

P[U  >  a]  >  P[V  >  a]  for  all  a.) 

With  (3*1) >  Theorem  2.3  yields  some  interesting  applications.  For 

-1  2 

example,  choosing  the  Schur  function  <p(z  ,  . . .  ,z^)  =  n  Zz  -  1,  we  obtain 

st 

(3*2)  I(X.-X)2/  X2  <  £(Yi-Y)2/Y2  . 

Choosing  the  Schur  function  <p(z,,...,z  )  =  nZaz/Zz,  a,  >  ...  >  a  , 

1  n  1  -  -  n 

z,  >  ...  >  z  ,  we  obtain 

1  -  —  n 

st 

(3-3)  Za.X./X  <  Za^./Y  , 

where  X  =  ZX/n,  Y  =  ZY/n. 

An  important  special  case  where  F  is  starshaped  with  respect  to 
G  (introduced  in  Section  4)  is  obtained  by  choosing  G(x)  =  e  and 
F  IFRA.  We  discuss  the  statistical  applications  of  (5*2)  and  (5*3)  in 
this  case,  although  the  results  hold  more  generally. 

I.  Consider  first  the  problem  of  testing  the  hypothesis  that  F  is 
exponential  versus  the  alternative  that  F  is  IFRA.  Because  of  the 
stochastic  ordering  given  by  (5*2),  a  test  based  on  the  statistic 
R(X)  =  Z(X^-X)  /X  is  unbiased.  (By  using  different  Schur  functions, 
alternative  statistics  could  be  used,  yielding  unbiased  tests.)  To 
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carry  out  the  test  at  the  level  of  significance  a,  determine  k  such 
that  P(R(X)  v  k  )  =a  when  F  is  exponential,  ana  reject  the  hypothesis 
of  exponentiality  when  R(X)  <  k  . 

For  a  test  of  the  hypothesis  that  F  is  exponential  versus  the 
alternative  that  F  has  increasing  failure  rate,  i.e.,  log  F(x)  is 
concave  where  finite,  see  Proschan  and  Pyke  (19^5)* 

II.  Next,  we  discuss  testing  for  outliers  when  the  distribution  F 
is  known  to  be  IFRA.  Suppose  that  X,  >  ...  >  X  are  order  statistics 
from  F  unless  possibly  that  X^  is  an  "outlier,"  i.e.,  X^  does  not 
arise  as  an  observation  from  F,  but  from  F^  <  F,  F^  ^  F. 

A  natural  test  of  the  hypothesis  that  X  >  . . .  >  X  all  come 

1  —  —  n 

from  F  is  to  reject  the  Hypothesis  if  X^/  X  is  too  large.  If  F 

is  unknown,  the  distribution  of  this  statistic  is  unavailable,  but  since 

F  is  IFRA,  it  follows  from  (9.3)  with  a^  =  1,  =  . . .  =  a^  =  0  that 

st 

X^/  X  <  Yj/Y,  where  Y^  >  ...  >  Yn  are  the  order  statistics  from  an 
exponential  distribution.  To  control  the  type  I  error,  note  that 
PlXj/X  >  kQ]  <  PiYL/ Y  >  ka)  =  a. 

Although  the  type  II  error  cannot  be  determined  without  knowing 
F^,  we  can  assert  that  the  type  II  error  is  smaller  under  F^  than 
under  F^  whenever  F^  is  starshaped  with  respect  to  F^. 

Similarly,  a  test  can  be  obtained  when  X  is  suspected  of  being 
a:,  outlier;  here  the  test  statistic  is  X  /  X,  and  we  use  (5*3)  with 

a.  -...  -  a  ,  =  0,  a  =  -  1  to  control  the  type  I  error  as  above. 

1  n-1  n 

"or,  generally,  a  test  against  the  possibility  that  ....  Xy  and 

X  ,  ...,  X  are  all  outliers  makes  use  of  (3*3)  with  a,  =  . . .  =  a  =1, 
s  r.  1  r 

a  _  ^  .  .  .  a  .  =  0,  a  =  . .  .  =  a  =  -  1.  The  test  statistic  is 
r+1  s-1  s  n 
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(X  +  ...  t  Xt 


"1 


.  -  )  /  x  . 

r 


.  Condition  I  .’umbers  . 


A  commonly  used  measure  of  the  difficulty  of  numerically  inverting 
a  non-singular  matrix  A  is  its  condition  number  c  (A)  =  'p(A)  <p(A  ^), 
where  ordinarily  (p  is  a  norm  (i.e.,  'p(A)  >  0  when  A  ^  0, 
cp(7A)  =  1 7 1  <p(A)  for  complex  y ,  *p(A+B)  <  'p(A)  +<p(B)).  Marshall  and 
Olkin  (1>j5)  show  that 


(o.l) 


%(A>  S  <=,,(***> 


when  (p  is  a  unitarily  invariant  norm  (i.e.,  cp(A)  =  Cp(UA)  =  rp(AU)  for 
all  unitary  matrices  U)  .  The  proof  is  based  in  part  on  a  result  of 
von  Neumann  (1937)  that  <p  is  a  unitarily  invariant  norm  if  and  only  if 

there  exists  a  symmetric  gauge  function  <J>  such  that  cp(A)  =  <I>(a)  for 

2  2  *  / 
all  A,  where  a.,  ...,  a  are  the  characteristic  roots  of  AA  .  (A 

'In 

function  $  on  a  complex  vector  space  is  called  a  symmetric  gauge  function 

(SGF)  if  $(u)  >  0  when  u  /  0,  C(/u)  -  |/j  cp(u)  for  complex  7, 

<t>(u+v)  <  <l>(u)  +  -f(v),  and  4>(u  , . ..,u  )  =  )  whenever 

n  1  11  n 

=  +  1  and  ji^,  ...,  ij  is  a  permutation  of  (l,  ...,n).)  Using  this, 
a  more  general  result  than  (6.1)  is  also  obtained  by  Marshall  and  Olkin 
(19°5)>  namely,  that  if  is  a  SGF  and  >  0,  then 


(o.2)  <t>(a^, . . . ,c/  )  $(a’r, . . .  jCf1 )  is  increasing  in  r  >  0  . 

The  proof  of  (o.2)  rests  on  the  following 

Lemma  6.1.  If  u  v,  $  is  a  SGF,  and  g  is  a  non-negative  convex 
function,  then. 


-r , 


(  -3) 


«(g(u))  <  *(e(v))  • 
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The  proof  of  (o.;)  was  given  by  Fan  (1951)  in  case  (i)  g(x)  =  x,  and 
ly  Marshall  and  Olkin  (l 905)  in  case  (ii)  g(x)  =  x  The  proof  of 

Lemma  6.1  parallels  that  of  (ii)  where  the  only  properties  of  x  ^  used 
are  its  non-negativity  and  convexity. 

We  note  that  every  SGF  is  a  Schur  function.  The  result  (6. 3)  does 
not  hold  for  Schur  functions  t  without  the  additional  condition  that 
g  is  increasing. 

In  Section  2  we  have  shown  that  if  \|r  >  0  is  starshaped  then 

( jr(P)/Z\Jr(P) )  )*(P/Zp).  In  the  following,  we  consider  non-negative 

starshaped  functions  of  the  form  t(P)  =  £c. P*^  where  c.  >  0.  From  (6.5) 

J  J 

with  g(x)  =  x  we  obtain  that  $(Pj/zp, . . . ,Pn/zp)  < 

®('Jf(Pj.)/2t(P)  >  •  •  •>'KPn)/£'lf(P))  •  From  (6.5)  with  g(x)  =  x"1  we  obtain 
that  «(ZP/PL,  ...,ZP/Pn)  <  4i(Z^(p)/t(P1),...,Z'|f(P)/t(Pn))*  By  multiplying 
these  inequalities  and  using  $(7x)  =  |  y |  ®(x),  we  obtain 

(6.1+)  «(P1,...,Pn)  <  •(Hr(P1),  -  --,Hr(Pn))  * 

r 1  n Z  '1  n  / 

The  special  case  \Jr  (P)  =  pS/^r,  0<r<s,  and  P^  =  o£,  i  =  1,  2,  . ..,  n, 

yields  (6.2). 

If  B  is  a  positive  definite  matrix  with  characteristic  roots 
^n’  ^en  ^he  characteristic  roots  of  i)r(B )  =  Zc^B^  are 
y(p^)  =  ZCjP^.  (For  any  positive  definite  matrix  C,  the  square  root 

*  V 

of  the  characteristic  roots  of  CC  are  the  characteristic  roots  of  C.) 
Hence,  in  terms  of  condition  numbers,  (6.4)  becomes 


(6*5) 


Vb>  <  y*(B» 


for  any  positive  definite  matrix  B. 
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For  a:.y  matrix  A  ai.d  any  unitarily  invariant  norm, 
c  ( A)  =  c  ((AA*)^).  With  B  =  (AA*)^  and  >Jr(B )  =  ZCjBJ,  it  follows 
from  (j.'3)  that 

(j.j)  c^(A)  <  Cp(\|f(AA  )^))  . 

2 

This  reduces  to  (o.l)  when  ijr(B)  =B  . 

{ .  Inequalities  for  Absolute  Deviations . 

n 

If  c^,  are  real  numbers  satisfying  £  c^  =0,  then 

(7.1)  &\c\S  <  (iz|c|  f  ,  S  >  1  . 


This  inequality  was  proved  by  Gatti  (1956)  for  s  =  2,  and  in  its  present 
form  by  Birnbaun  (1958)*  Generalizations  of  (7*1)  can  be  obtained 
using  Ti'.eorems  2.3  and  2.4. 


n 

Lemma  7.1-  If  c^,  c^  satisfy  £  ^  =  0,  u^  =  |  c^  |  , 


and 


u,  >  ...  >  u  ,  then 

1  -  -  tv 


(7-2) 


u 


u 


1  n 

Zu  *"'*  Zu 


.  .  .  ,0)  . 


n 


n 


Proof.  From  c  =-),c  we  have  u..  =  \)  c .  |  <  y  u  ,  so  that 
n  I21n  1  2  1  2  1  n 

2u^<:]T'u^,  and  consequently  u]/£Ui  <  i*  Of  course,  (u^+u^J/JI  u±  _  1. 


From  (2.2),  it  follows  that  for  0  <  r  <  1, 


(7.3) 


r  r  \  , 

u.  u  1  lu.  u 

1  n  |  j  |  1  n 

~  r  *  Zu  ,,,>  Zu 

L. U  Zu  /  ’ 


Since  <p(x  ,  ...,x  )  =  Zx*,  t  >  1,  is  a  Schur  function,  we  have  from 
Theorem  2.3  and  (7*2), 

r  it 


u 


— r  £  2  ^  , 

IZu  /  2t 


t  >  1  . 
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The  choice  t  =  s/r,  s  >  r,  yieldi 


(7.-0 


( ^Zur )  >  ( -|-Zu‘J )  ^ 


1,  r  < 


The  result  (7*1)  corresponds  to  the  case  r  =  1.  The  restriction 

0  <  r  <  1  cannot  be  relaxed  to  0  <  r,  as  may  be  seen  from  the  choice 

c.  =  1,  c_=  . .  .=  c  =  — Kr  and  r  >  1. 

1  2  n  n-1 

Remark.  If  in  Lemma  7 .  1  we  define  =  h(|c^|)  where  h  is 
non-negative  and  subadditive,  i.e.,  h(x+y)  <  h(x)  +  h(y),  then  (j  .2) 
remains  valid.  The  function  h(x)  =  x  ,  0  <  r  <  1,  leads  to  (7*4).  If 
h(x)  is  non-negative  and  concave  for  x  >  0,  with  h(o)  =  0,  then  h(x) 
is  subadditive  in  x  >  0. 

As  a  consequence  of  (7-1),  De  Novellis  (1958)  proves  that 


(f.5) 


Zu°  +  ^  <  4  Zu  Zu^  for  s  >  0  . 


Using  the  generalized  version  (7*4)  of  (7*l)>  a  generalized  version  of 
(7.5)  can  be  obtained,  namely, 


(7.o) 


Zu 


r+i 


5  Zu1*  Zu" 


0  <  r  <  1  . 


1  1*  /  r 

From  (7*2)  and  (To)>  we  see  that  £  >  u^/Zu  ,  so 


that 


„  r+s  /„  £ 

IA1  /  Z.U 


=  Z.U 


r  u 


u  ^  r  1 

—  <  ui  <  r 

Zu  /  i 


„  r 

L.M  , 


which  yields  (7 -o) . 

The  role  of  V  in  (7*2)  stems  from  the  fact  that  c^,  ...,  c^  can 
be  divided  into  two  sets  with  equal  sums  (except  for  sign).  Suppose  we 

have  tiiree  set-  of  positive  numbers  (x),  (y),  { z }  such  that 

**1  r.  n, 

7  Xi  l  y)  *  L  V  ^  U1  -  2  un  I  "  =  nl  +  "2  +  nj)  be  the 

1  1  1 
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ordered  x's,  y's,  and  z's.  Then 


JL  Ji  V  |i  i  I  o  i 

lu  ’•••’  zu  ^  b’  y  y 


By  using  arguments  analogous  to  those  for  the  case  of  two  sets,  we 


obtain 


1  r  L//r  Is  1//s 

^  Zu  >  ±  Zu  ' 

3  )  ~  J 


0  <  r  <  1,  r<s  , 


which  parallels  (7-4)..  The  extension  to  the  case  of  an  arbitrary  rimber 
of  sets  is  immediate. 

APPENDIX 

Theorem.  There  exists  a  decreasing  failure  rate  (DFR)  distribution 
r 

with  moments  -  r  ,  r  >  0. 

Proof.  (H.  Rubin).  Consider 


rr  =  r  xrf(x)dx  =  f  f(^y)  e‘yay 


r-  ^ 

_  f  P-yr 


e  ^  g(y)dy  . 


From  the  inversion  formula,  Widder  (19^1)  p.  2hl, 

1  ~  °  r  vr  i  ~ 30  r 

g(y)  =  2ni  V  e  dr  "  2 «T  /  exp^r  log  r  +  y  Jdr  * 

-DO  — » 

Transform  from  r  to  Q  by  r  =  p(y,0)  e*"  choosing  p  in  such  a 

1* 

manner  that  r  log  r  +  y  is  real.  Since 


r  log  r  +  yr  =  p[cos  e(log  p+y)  -  0  sin  r  ]  +  ip  [sin  e(log  p+y)  +  0  cos  j] 
=  a  +  hi  , 


we  choose  b  =0,  i.e., 
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(  A .  1)  log  p  =  -  0  cot  0  -  y  . 

Then 


r  log  r  +  yr  =  p[cos  d(-6  cot  0)  -  0  cos  0)  =  -  p0  esc  0 
=  -  e  ( 0  esc  0  e  ) 

=  -  e  y  A{0)  . 


To  determine  dr/d0,  note  that  from  (A- 1)  ^  ^  =  0  csc^0  -  cot  0,  so 


that 


dr  0  /  ,  10  dp 

35  *  p  35  (cos  a+i  sln  e)  +  e  dS 


=  p  esc  0  [  0  cot  0  -  1]  +  ip0  esc  0  =  c  +  di  . 


Hence 

g(y)  =  exp[-  e'yA(0)j  (c+di)  d0  =  L  +  Mi  . 

But  g(y)  is  real,  so  that  the  right-hand  side  must  be  real.  This 
implies  that  the  complex  term  vanishes,  and  that 

g(y)  =g7  exp[-  e"yA(a) ]  e"yA(a)d0 

w  -n 

=  -  /  exp[-  e’yA(e)]  e”yA( 0)d0  . 

-0 

-y 

Let  x  =  e  ,  then 

f(«)  -  »<-.  *> .  i  r  •■rt(9)*(0)« . 

-  0 

vhich  is  a  mixture  of  exponential  distributions,  and  is  automatically 
DFR. 
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